SIMULTANEOUS SIMILARITY AND TRIANGULARIZATION 
OF SETS OF 2 BY 2 MATRICES 
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Abstract. Let A = {Ai, An, ■■■) be a finite or infinite sequence of 2 x 2 matrices 
with entries in an integral domain. We show that, except for a very special case, A is 
(simultaneously) triangularizable if and only if all pairs {Aj,Ak) are triangularizable, for 
^ ^ jjk < oo. We also provide a simple numerical criterion for triangularization. 

Using constructive methods in invariant theory, we define a map (with the mini- 
mal number of invariants) that distinguishes simultaneous similarity classes for non- 
commutative sequences over a field of characteristic 2. We also describe canonical 
forms for sequences of 2 x 2 matrices over algebraically closed fields, and give a method 
Oh' for finding sequences with a given set of invariants. 
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1. Introduction 



1.1. Motivation. The properties of a set of square matrices which are invariant under 
simultaneous conjugation have been the subject of many investigations. In the case of 
I a pair of matrices many problems have been solved including finding criteria for simul- 

■ taneous similarity, simultaneous triangularization, existence of common eigenvectors, etc. 

(— I ! Analogous problems have been solved for subalgebras, subgroups or sub-semigroups of 

square matrices (see, for example, [FY j iL j [RR] and references therein). 

Here, we mainly concentrate on the problems of simultaneous similarity, simultaneous 
triangularization and canonical forms of countable sets of 2 x 2 matrices with entries in an 
arbitrary field, with the focus on effectively computable solutions (the triangularization 
results will hold over integral domains). 
^SJ ' Even though we are concerned with 2x2 matrices, it is convenient to describe the setup 

CO . more generally as follows. Fix integers m > 1, n > 1 and let Vm,n be the vector space 

of n-tuples of m X m matrices with entries in a field F. The group of invertible matrices 
^ G := GLm(IF) acts on Vm,n by simultaneous conjugation on every component: 

O; (1.1) g-A:={gAig-\...,gAng-^), 

OO . 

Q , where g ^ G and A = {Ai, An) G Vm,n- By analogy with the case n = 1, the orbit 

G ■ A := {g ■ A : g £ G} C Vm,n will be called the conjugacy class of A, and can be 
viewed as an element [A] £ Vm,n/G of the quotient space. One can consider the following 
problems. 

\ (i) Identify one element for each conjugacy class in a natural way, i.e, list all possible 

'canonical forms'; 
(ii) Construct invariants that distinguish all conjugacy classes. 
Naturally, these problems are related. In general, a solution to (i) will lead to a solution 
of (ii); however, the answer obtained in this way might be unnatural, and the description 
in terms of invariants is often useful. 

These are difficult questions in general. Here, we will be concerned with the description 
and classification of conjugacy classes in the case m = 2. As we will see, in this case 
the questions above admit simple and complete answers, given by explicitly computable 
numerical criteria. In order to state our results in concise terms (see Definition II. 4p we 
will adopt the following terminology for (ordered) sets of matrices. 



Key words and phrases. Simultaneous similarity. Simultaneous triangularization, Invariants of 2 by 2 
matrices. 
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Definition 1.1. An element A = {Ai, A^) € Vm,n will be called a matrix sequence 
of size m X m and length n, or just a matrix sequence when the integers m and n are 
assumed, and the matrices Ai, ...,An will be called the components or terms of A. We say 
that two matrix sequences A = {Ai, An) and B = {Bi, Bn) in Vm,n are similar and 
write ^ ~ ;S if they lie in the same conjugacy class (or G orbit) i.e, if there is an element 
g £ G such that B = g ■ A (equivalently, Bj = gAjg~^ for all j = 1, n). 

We will also allow sequences A = {Ai, An, ■■■) £ Vm,oo with a countable infinite 
number of terms. 

It turns out that a concrete answer to the above problems requires the separation of 
all matrix sequences (of fixed size and length) into G-invariant subclasses with distinct 
algebraic and/or geometric properties. For instance, for the class of matrices that pairwise 
commute some of these questions are easier. This case can be reduced to the case of a 
single matrix (n = 1) which is classically solved with the Jordan decomposition theorem 
(at least when F is algebraically closed and some term is nonderogatory) . 

One property that is often relevant is that of simultaneous triangularization (or block 
triangularization); it generalizes commutativity and has a natural geometric interpreta- 
tion. To be precise, let us consider the following standard notions. Note that a sequence 
n can be viewed either as an ordered set of m x m matrices, as in the definition 
above, or alternatively, as a single mxm matrix whose entries are n-dimensional vectors. 



Definition 1.2. Let ^ be a matrix sequence. A will be called commutative if all its terms 
pairwise commute (and non-commutative otherwise). We will say that A is an upper 
triangular matrix sequence if all the vectors below the main diagonal are zero {ajk = 
for all j > k va equation (jl.2p ). and that A is triangularizable if it is similar to an upper 
triangular matrix sequence. 

In geometric terms, A is triangularizable if and only if there is a full flag of vector 
subspaces of F™ which is invariant under every term Aj of A (for the standard action of 
G on F™). Observe also that upper and lower triangularization are equivalent (over any 
ring). For the reasons mentioned, we will add the following triangularization problem to 
the previous list. 

(iii) Give an effective numerical criterion for a matrix sequence to be triangularizable. 
Problem (iii) was solved effectively by the following refinement of McCoy's Theorem ( |Mc] ) . 
Let [A, B] = AB — BA denote the commutator of two matrices. 

Theorem 1.3. [McCoy, see |Mcj .[L]] Let F be algebraically closed. A m x m matrix 
sequence A of length n is triangularizable if and only if 



is nilpotent for all monomials p{x) (in non-commuting indeterminants xi, ...,Xn) of total 
degree not greater than vn? and all terms A, A' of A. 



To discuss simultaneous similarity, consider the notion of subsequence. 

Definition 1.4. A subsequence of a matrix sequence A = {Ai, ...,An) G V"m,n is a matrix 
sequence of the form 



(1.2) 




p{Ai,...,An)[A,^] 




Aj = {Aj_ 
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obtained from A by deleting some of its terms (here J = £ {1, ...,n} for some 

natural number / < n, and the indices are strictly increasing: 1 < ji < ■■■ < ji < n). 

Let A = F[^i, An] be the algebra generated over F by the terms of A (of dimension 
< m?). Suppose we rearrange the terms of A such that A is in fact generated by just the 
first k < m? elements. Then, for all j > k, Aj = pj{Ai, A^) for some polynomial pj 
with coefficients in F. From this one can easily obtain a similarity test. 

Fact 1.5. Let A, B be two mx m matrix sequences of the same finite or infinite length n. 
Then A and B are similar if and only if all corresponding subsequences of length < 
are similar (that is, Aj ~ Bj for all J = (ji, ...,ji) with 1 < ji < ... < ji < n and I < m?). 

One can improve this statement if we restrict to a big class of matrix sequences. The 
Dubnov-Ivanov-Nagata-Higman [DINHj theorem states that, given m G N, there is a nat- 
ural number N{m) with the following property: every associative algebra over F satisfying 
the identity = is nilpotent of degree N{m) (i.e, any product xi ■ ■ ■ of k elements 
xi,...,Xk of the algebra is zero, for k > N(m)). It is known that < N{m) < 

min {2™ — 1, [Raj , and it was conjectured that N{m) = I!l(!^±i) . This was verified to 
be true for m = 2, 3, 4. 

The following remarkable result of Procesi ([Pr]) relates invariants of matrices and 
nilpotency degrees of associative nil-algebras. It provides explicit generators for the 
algebra of G- invariant regular (i.e, polynomial) functions on Vm,n- For a multiindex 
J = (ji,...,jfc) G {l)---)'^}^ of length \ J\ = k, define the G-invariant regular function 
tj : Vn{y) — > F as the trace of the word in the terms of ^ G Ki(F) dictated by J, that is 



G-invariant polynomial function. Then f is a polynomial in the set of functions 



where \J\ denotes the length of the multiindex J. 

Let us say that a matrix sequence A is semisimple if its G orbit is Zariski closed in 
Vm,n- Semisimple sequences form a dense subset of Vm,n- Noting that A^(2) = 3, Procesi's 
theorem has the following corollary. 

Corollary 1.7. Let A,B be two 2x2 semisimple matrix sequences of the same finite or 
infinite length. Then A ^ B if and only if all corresponding subsequences of length < 3 
are similar (that is, Aj ~ Bj for all J with length <3). 

This corollary gives a numerical criterion for simultaneous similarity of semisimple se- 
quences. In the case m = 2, the number of generators can be further reduced, and 
moreover, Drensky described all relations in terms of a generating set ([D])- He states his 
main theorem for traceless matrices, but an easy modification yields: 

Theorem 1.8. [Drensky, [D]] The generators of the algebra (over ¥ of characteristic 0) 
of G-invariant polynomial functions on V2,n are given by: 

tj, tjj, Ujk, Sjki 1 < j < k < I < n, 

where ujk ■= "^tjk — tjtj^, sj^i ■= tj^i — ti^j and tj,tj^,,tjki are as in il.3\) . A full set of 




¥ be a 



{tj : \J\ < N{m)} 



relations is 




"^eaSbcd ^eb^acd ~l~ '^ec^abd ^ed^abc — 
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In this article, we show that these same generators can be used to get even more exphcit 
solutions to problems (i)-(iii) in the case m = 2. Our methods are mostly elementary and 
their generalization to higher m seems possible, although computationally very involved. 

1.2. Statement of the main results. From now on, we restrict to the space Vn = V2,n 
of sequences of 2 x 2 matrices of length n E N U {oo}, i.e, the case m = 2, except where 
explicitly stated. 

The article can be roughly divided into two parts. In the first part, Section [21 we work 
with the space Vn{R) of 2 x 2 matrix sequences with coefficients in an integral domain 
i?, on which the group G = GL2{R) of invertible matrices over R acts by conjugation. 
Here, we define and study reduced sequences, consider the triangularization problem, and 
prove Theorems 11.111 to 11.131 stated below. They provide an efficient numerical criterion 
for triangularization of matrix sequences in Vn{R)- 

The case m = 2 is simple enough that some easy arguments already improve some of the 
statements above, even in the more general case of 2 x 2 matrices over integral domains. 

Proposition 1.9. A G Vn{R) is triangularizable if and only if every subsequence of length 
<3 is triangularizable. 

For simultaneous similarity, a simple argument generalizes and improves Corollary 11.71 
in the case m = 2, to account for all matrix sequences (not necessarily semisimple). 

Proposition 1.10. Over an integral domain A ^ B if and only if Aj ~ Bj for all J of 

length < 3. Moreover, under the generic condition [^1,^2] ^ 0, A is similar to B if and 
only if {Ai,A2,Aj) ~ {Bi,B2,Bj), for all j = 3, ...,n. 

In other words, the conjugation action of GL2{R) on 2 x 2 matrix sequences of any 
length is completely determined by the same action on triples of 2 x 2 matrices. These 
two propositions seem to be standard, and I thank R. Guralnick for furnishing simple 
arguments leading to their proofs, included below for convenience and completeness. 

The statement of Proposition 1 1 . 91 is the best possible in this generality, as there are 2x2 
matrix sequences of length 3 (therefore, a fortiori for every size m x m, m > 2 and every 
length n > 3) that are not triangularizable but are pairwise triangularizable (see Example 
12.111 below). See however Theorem 1 1 . 1 2 1 b elow . 

Let us now consider the problem of finding a numerical criterion for triangularization 
of general 2x2 sequences. By Proposition 11.91 we just need to consider triples of 2 x 2 
matrices. Using only the G-invariant functions given by the trace and the determinant, 
the following is a simple numerical criterion. 

Theorem 1.11. ^2x2 matrix sequence A (over R) of length n is triangularizable if and 
only if all its terms are triangularizable and 

(1.5) det{AB - BA) = tr{ABC - CBA) = 

for all terms A,B,C of A. In particular, a pair {A, B) G V2{R) is triangularizable if and 
only if both A and B are triangularizable and det{AB — BA) = 0. 

Remark. Over an algebraically closed field F, every single matrix is triangularizable. Sup- 
pose ^ is a 2 X 2 matrix sequence over F with [A, and C[^, i?] nilpotent for all terms 
A,B,C of A. Then, Equation (jl.Sp holds and Theorem 11.111 shows that all mentioned 
bounds in Theorem 1 1 . 3 1 can be improved for m = 2, as the condition that p is a monomial 
of degree < 1 is already sufficient (and necessary) for triangularization. 

Theorem 11.111 generalizes a result proved in [Fl] for algebraically closed fields. Observe 
that the case n = 2 of this theorem is a direct generalization to integral domains of a well- 
known criteria, obtained in for a pair of 2 x 2 matrices over F to be triangularizable. 
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As a consequence of our study of invariants, we can improve Proposition 11.91 under a 
simple non-degeneracy condition on A. Let us define a reduced sequence to be one with 
no commuting pairs among its terms. 

Theorem 1.12. A 2x2 reduced matrix sequence A of length > 4 is triangularizable (over 
R) if and only if all subsequences of A of length < 2 are triangularizable (over R). 

With a httle more care we get a test, computationally much more efficient, whose 
complexity grows only linearly with the number n of matrices in A. Let us define the 
reduced length of a sequence A to be the biggest length of a reduced subsequence B C A. 

Theorem 1.13. Let A = {Ai, An) £ Vn have reduced length I < n and rearrange its 
terms so that A' = {Ai, ...,Ai) is reduced. Then 

(i) In the case I < 3, A is triangularizable if and only if A' is triangularizable. 

(a) In the case I > 4, A is triangularizable if and only if Ai, Ai are triangularizable 
and det([^j, A/^]) = for all j = 1, 2, 3 and all j < k < I. 

In the second part of the article, we work mainly over a field F. Section [3] deals with si- 
multaneous similarity for 2x2 matrix sequences. Using standard invariant theory, one sees 
that the values of the Drensky generators are enough to distinguish semisimple conjugacy 
classes. But these classes should depend on 4n — 3 parameters only, the "dimension" of 
the quotient space Vn/G {Vn has dimension 4n, and G acts as (F), a three dimensional 
group) which is much less than the number, 2n+ (2) + (3) , of Drensky generators. After de- 
scribing rational invariants that distinguish general triangularizable sequences, we obtain 
a solution, with the minimal number of invariants, to problem (ii) for non-commutative 
sequences as follows. 

Let S' (resp. W) denote the subsets ofVn = Vn(V) of semisimple (resp. triangularizable) 
sequences such that Ai is diagonalizable and [Ai, ^2] 7^ 0. Using the G-invariant functions 
tj -.Vn ^¥ in dOl) define the maps : S'/G F^"-^ and ^' : U'/G F^" x P"-2, 
where P'^ denotes the projective space over F of dimension k, by the formulae 

(1.6) <^'([^]) = {tl,tii,t2,t22,ti2, ■..,tk,tik,t2k, Sl2k, ■■■itn,tln,t2n, Sl2n) , 

(1.7) ^'{[A]) = {ti,tu,t2,ti2,...,tk,tik,...,tn,tin; Ip') , 

where tp' is defined later in Section HI and [A] denotes the conjugacy class of A. 

Theorem 1.14. Let ¥ be a field of characteristic / 2. The map ^' : S'/G F^"~^ is 
infective and the map : W /G F^" x p"~2 two-to-one. 

The last section (Section d]) concerns the classification of canonical forms for sets of 
2x2 matrices over F and proposes a solution for problem (i). The main result is Theorem 
14.31 where five types of canonical forms are obtained for sequences with at least one non- 
commuting pair. It shows that, for non-commutative sequences, the restriction to S' and 
W in Theorem 11.141 is only apparent (see also Remark 14. 6p . We also describe a simple 
method for finding a sequence in canonical form with a given value of or 

Appendix A contains results on the triangularization a single 2x2 matrix over R which 
are crucial for Theorem I I.IH and Appendix B, for completeness, describes the well-known 
canonical forms of commuting matrices over F. 
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and motivating conversations on this and related topics. Work partially supported by 
the CAMGSD, 1ST, Technical Univ. of Lisbon, and "Fundagao para a Ciencia e a Tec- 
nologia" through the programs Praxis XXI, POCI/MAT/58549/2004 and FEDER. Some 
computations performed using CoCoA; document typeset using LyX. 
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2. Simultaneous Triangularization 

Throughout the article, R will stand for an integral domain, F for a field and F for an 
algebraically closed field. Vn (resp. G) will denote the space of matrix sequences of length 
n G N U {oo} (resp. the group of invertible 2x2 matrices) over the appropriate ring or 
field. When the coefficients need to be explicitly mentioned, we will use the notations 
Vn{R), G{R), etc. Sequences of length 1,2,3 and 4 will be called singlets, pairs, triples 
and quadruples, respectively. 

2.1. Simultaneous triangularization and subtriples; reduced sequences. We start 
by fixing notation and recalling some well known facts about matrices over F and R. After 
this, we define reduced sequences, a notion which will be fundamental in the sequel. 
For a given A G Vn{R), a matrix sequence of length n S N, we will use the notation 

A= {Ai,...,An) = (5! 5), a,b,c,deR'', 



c d 
Cj dj 



(2.1) [Ai,A2 



and we let e = (ei, e^) denote the n-tuple a — d £ i?". To avoid the most trivial case, 
we consider only matrix sequences with at least one non-scalar term. 
The commutator of 2 matrices Ai and A2 is given by 

hc2 - C162 6162 - 6162 
cie2 - eiC2 C162 - 61 C2 

For later use, record the following straightforward but useful lemma. 

Lemma 2.1. Let A = (^1,^2) G V2(i?) and A\ he a non-scalar matrix. If Ai is upper 
triangular, then [^1,^2] = if and only if A2 is also upper triangular and 

(2.2) 6162 - 6162 = 0. 

Similarly, let Ai he diagonal non-scalar. Then [^1,^2] = if o-nd only if A2 is also 
diagonal. 

Proof. Suppose [j4i,j42] = with ci = 0. Then we have 6162 — 61^2 = &1C2 = eiC2 = 0, 
using (j2.ip . Since Ai is non-scalar either ei or hi is non-zero. In an integral domain, this 
implies 6162 — 6162 = C2 = 0. The other statement is similar. □ 

This implies the following well known result. Note that a 2 x 2 matrix is non-scalar if 
and only if it is nonderogatory. 

Proposition 2.2. Let A be a commutative matrix sequence (i.e, all terms pairwise com- 
mute) of finite or infinite length over an integral domain R. Then A is triangularizable if 
and only if one of its non-scalar terms is triangularizable. Similarly, A is diagonalizable 
if and only if one of its non-scalar terms is diagonalizable. □ 

As a consequence, the proof of Proposition 11.91 is complete after the following. Let us 
denote by A the algebra generated over F, the field of fractions of R, by the terms of A. 
A well known result is that A is commutative if and only if the dimension of A is < 2. 

Proposition 2.3. Letn > 2 and A G Vn{R) be a non- commutative matrix sequence. Then 
A is triangularizable if and only if all subsequences of A of length < 3 are triangularizable. 

Proof. If A is triangularizable, it is clear that all subsequences of A are also triangulariz- 
able. Conversely, since A is non-commutative, we can assume without loss of generality, 
that [Ai,j42] 7^ 0, and that, after a suitable conjugation, both Ai and A2 are upper 
triangular. By hypothesis, all triples of the form {Ai, A2, Af^), k = 3, ...,n are triangular- 
izable. Then, the algebra A^, generated by Ak = {Ai, A2, A/^) equals the one generated by 
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(^1, A2), since one is a subset of the other but both are three dimensional over F. Indeed, 
if Afc was of dimension < 2, Ak would be commutative, and if was four dimensional, 
Ak would not be triangularizable. Therefore, for all j = 3, ...,n, Aj = pj(Ai, A2) for some 
polynomial pj with coefficients in F and thus, Aj is also upper triangular, for all j. □ 

Recall also the following. 

Lemma 2.4. Let A = {Ai,A2) G V2(F) be commutative and Ai be a non-scalar matrix. 
Then A2 = p{Ai) for some degree 1 polynomial p{x) £ ¥[x]. 

Proof. From Equation ()2.1|) . the conditions [^1, A2] = can be written as Mu = 0, where 

/ ci -ei \ 
(2.3) M= -ci 61 , n = (62,62,02). 

V ei -hi / 

Note that M has rank exactly 2, since Ai is non-scalar. So, the vector u = {b2, 62,02) is 
in the nullspace of M, which is generated by (61, ei, ci) 7^ 0. Thus, there is an a G F such 
that {b2, 62,02) = a(6i,ei,ci). Then, A2 = {d2 — adi)I + aAi is, explicitly, the required 
polynomial (/ denotes the identity 2x2 matrix). □ 

It is clear that a single matrix is triangularizable over an algebraically closed field, but 
not necessarily so over a general integral domain or field. In Appendix A we include a 
short account of the conditions for triangularization of a single matrix in Vi{R). The 
following notion will play a central role. If ^ is a subsequence of B, we will write A^B. 

Definition 2.5. A matrix sequence with at least one non-scalar term A = {Ai, ...) G Vn{R) 
is called reduced if there are no commuting pairs among its terms, that is, [Aj,^^] 7^ 0, 
for all 1 < j < < n. We say that ^ is a reduction of if ^ is reduced and is obtained 
from B by deleting some of its terms. Finally, we say that ^ is a maximal reduction of B, 
and that / is its reduced length, if ^ is a reduction of B of length I, and any subsequence 
A' ^B with length > / is not reduced. 

Over a field, by Lemma [2.41 ^ reduced sequence A is one where no term is a polynomial 
function of another (so all terms generate an algebra of dimension 2, and no two terms 
generate the same subalgebra of the full matrix algebra) . It is clear that any two maximal 
reductions have the same length. Note also that any subsequence of a reduced sequence 
is also reduced. The following facts show that important properties like existence of a 
triangularization are captured by any maximal reduction of a matrix sequence. 

Proposition 2.6. Let A = {Ai, ...,An) G Vn{R) and let An+i commute with at least one 
non-scalar term of A. Then A is triangularizable if and only if A! := {Ai, A^, An+i) 
is triangularizable. 

Proof. Naturally if A! is triangularizable, A is also. For the converse, without loss of 
generality assume = with Ai non-scalar and A in upper triangular form. 

Then, by Lemma l2.H A^+i is also upper triangular, so that A' is triangularizable. □ 

Corollary 2.7. If A and A! are arbitrary sequences with a common maximal reduction 
(of length > Ij, then either they are both triangularizable or both not triangularizable. 

Proof. Let B be such a common maximal reduction. Then A and A! are obtained from B 
by adding scalar matrices or matrices that commute with some of the non-scalar terms of 
B. So, if B is triangularizable, both A and A! are triangularizable by repeatedly applying 
Proposition 12.61 The case when B is not triangularizable is analogous. □ 
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2.2. Necessary conditions for triangularization via invariants. We continue to 
work over an integral domain R. Define the following important GL2(i?)-invariant func- 
tions. For a matrix A £ Vi, let 6 a denote the discriminant of its characteristic polynomial, 
that is 6a = tr'^A - AdetA. 

Definition 2.8. Let r, a : V2{R) ^ R and A : VsiR) ^ R be defined by 



riA,B) 
a{A,B) 
AiA,B,C) 



2n{AB) -tr{A)tr{B), 
det{AB - BA) 
{tr{ABC - CBA)f . 

When a matrix sequence is written as ^ = {Ai, An) G VniR) we will also use 

Tjk = Tjk{A) = T{Aj,Ak) 

(Tjk = (^jk{-^) = (^iAj,Ak) 
Ajki = AjM{A) = A{Aj,Ak,Ai), 
for any indices j,k,l E {1, n}. By simple computations , we can express these functions 



in terms of bj, Cj, ej as follows. 



(2.4) 
(2.5) 



o-jk 
^jki 



CjCk + 2bjCk + 2cjbk 

{bjCk - ejbk) {cjCk - ejCk) - {bjCk - Cjbkf 

bj bk h 
ej efc ei 



Note that r, a and A are symmetric under permutation of any matrices/indices, but 
a and A vanish when 2 matrices/indices coincide. Since the above expressions do not 
depend explicitly on the variables aj or dj but only on the difference ej = Uj — dj, the 
functions r, a and A are invariant under translation of any argument by a scalar matrix, 
that is, for any matrices A, B and scalar matrices A, fi, we have t{A + X, B + fi) = t{A, B) 
and similarly for a and A. 

Remark 2.9. Note that these are essentially the same functions used in Drensky's theorem 
11.81 [D]. They also coincide with the functions used in [FI], up to a constant factor. There 
are some interesting relations between these invariants which are obtained from simple 
calculations. In particular, we have 

t{A, A) = 6a = tr^A - 4det^ 

1 



(2.6) 



a{A,B) 
A{A,B,C) 



tr{A[A,B]B) = -{T{A,A)T{B,B) 



r{A,B) 



t{A,A) t{A,B) t{A,C) 
t{B,A) t{B,B) t{B,C) 
Tie A) t{C,B) r(C,C) 

for all matrices A,B,C over R, in agreement with Equation (|1.4|) 



The following is a simple necessary condition for triangularization. 

Proposition 2.10. Let A £ Vn be a triangularizable sequence. Then a{A, B) and A{A, B, C) 
vanish for all terms A, B, C of A. 

Proof. Since a and A are G invariant, we can assume that A is upper triangular. By direct 
computation, a{A, B) = det([^, B]) = 0, and A{A, B, C) = {tr{ABC - CBA)f = 0. □ 

Note that the vanishing of all dj^ = cr{Aj,Ak) is not sufficient for A to be triangular- 
izable, as the following important example shows. We adopt the usual convention that 
blank matrix entries stand for zero entries (in this case £ R). 
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Example 2.11. Let A = (^1,^2,^3) G V3 have the form 

for some ai,...,(i3 G i?. Then ai2 = C13 = and a23 = — 62C3(e2e3 + 62C3). Assume 
that 6263 + 62C3 = and that eib2C3 / 0, so that ah ajk vanish, neither A2 or ^43 are 
diagonal, and (since these assumptions imply 6263 / 0) all three matrices have distinct 
eigenvalues. So, in this case, all subsequences of length < 2 are triangularizable, but the 
next Proposition will show that A is not triangularizable. 

Proposition 2.12. As in Example \2.11\ let A = {Ai, A2, A3) ^ V3 be a triple of the form 

with 6263 7^ 0. Then, the following are equivalent. 

(i) A is reduced (ii) A is not triangularizable, (Hi) A123 7^ (i.e, ei62C3 7^ Oj. 

Proof, (i) or (ii) imply (iii): If ei62C3 = at least one of the factors is zero. In each 
case, Ai is a scalar, A is lower triangular, or A is upper triangular, respectively, so A 
is triangularizable and is not reduced, since Ai commutes with one or both of the other 
terms, (iii) implies (ii) and (i): Suppose that ei62C3 7^ 0. Then, none of the three numbers 
is zero. Let g be the SL2{R) matrix with columns (x,?/) and {z^w). Then 

gA.g-' = ( * -""^0 
^ Y ywei * J 

(2.8) gA2g-' = ( , * , ^(^^2 - .62) \ 
^ ' V y(^^2 - yb2) * ) 

gA.g-' = ( * , -z{xe3 + zc3)\ 

\ wives + WC3) * J 

from which it follows that there is no g G G that will make g ■ A upper or lower triangular, 
so A is not triangularizable. Also, by Lemma [2. H none of the 3 commutators between the 
pairs will vanish, so A is reduced. □ 

2.3. Numerical criteria for triangularization. A simple necessary and sufficient nu- 
merical condition for triangularization of a pair of 2 x 2 matrices over an algebraically 
closed field was given in the article [E] , which also describes the similarity classes of pairs 
of m X m matrices in great generality. 

Proposition 2.13. [Fr] A pair (^1,^2) G ^(IF) is triangularizable if and only if a 12 = 
det[Ai,^2] = 0. 

Note that Friedland writes the condition ai2 = in a different, but equivalent form (see 
Equation (j2.6p in Remark 12. 9p . The generalization to integral domains is as follows. 

Theorem 2.14. A pair A = (yli, A2) G V2{R) is triangularizable if and only if both Ai 
and A2 are triangularizable and (T12 = det[j4i, A2] = 0. 

Proof. The conditions are clearly necessary. For the converse, let us suppose that both 
Ai and A2 are triangularizable and det[Ai,^2] = 0. Then, as the determinant is GL2{R) 
invariant, we can assume Ai upper triangular (ci = 0). If [^1,^2] = the pair (^1,^2) 
is triangularizable by Corollarv \2.2\ so we can assume that [^1,^2] / 0. Equation (|2.ip 
shows that 

(2.9) = det[^i, A2] = -blcl + 6102(6162 - 6162) = -e2(6?e2 + 616162 - 6?62)- 

If C2 = 0, ^ is triangularizable. If not, 62 7^ and we distinguish 4 cases, (i) If 61 = 0, 
then 61 7^ (as Ai is non-scalar) which makes equation (|2.9p impossible to solve, (ii) If 
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bi = 0, then ei ^ 0, and equation (12. 9p implies 62 = and A is lower triangular, (iii) 
Suppose now 61^2 = &ie2- Then = det[Ai, A2] = —c^b\ and so 61 = which reduces to 
the previous case. 

Finally, consider the general case (iv) with 812 = 6162 — 6162 7^ and non-zero bi 
and ei. So, we are assuming 026262 7^ 0. From equation (j2.9|) . the quadratic equation 
Q2{x,y) = C2X^ — e2xy — 622/^ = associated to A2 (see Appendix A) has a non-trivial 
solution: (61,— ei) S R^. Suppose that A2 is nondegenerate {5a / 0). Then, by Lemma 
IA.21 there are z', w' in F, the field of fractions of -R, such that w'bi + z'ei 7^ and the 
eigenvectors of A2 are multiples of (61, — ei) and of {z' ,w'). So, we choose an eigenvector 
of A2 of the form {z",w") £ B? colinear with {z',w') E F^. Moreover, by Proposition 
lA.ll there are a,/? G F* = F \ {0} so that the eigenvectors {x,y) = a(6i,— ei) and 
{z,w) = P{z",w") verify either xR + yR = R or zR + wR = R. If the first alternative 
holds, let xq — yp = 1 for some p,q £ R, (note that (p, q) is not necessarily an eigenvector 
of A2) and put: 



so A is again triangularizable. If the second alternative holds, we do the same interchanging 
the roles of {x,y) and {z,w). Finally, suppose that A2 is degenerate {5a = 0). Then, there 
is only one eigenvector of A2 and the solutions to Q2{x,y) = form a single line through 
the origin in F^, so all solutions {x,y) £ R^ are multiples of (61, — ei) G R^. Since A2 is 
triangularizable, by Proposition lA.ll we can choose (x, y) so that xR + yR = R and we 
proceed as before. □ 

Example 2.15. Over the integral domain R = C[u,v], consider the pair 



Then we have Ai upper triangular and ai2 = as can be checked, but A2 is not triangu- 
larizable over C[n, u], as no eigenvector {x,y) G R^ satisfies xR + yR = R. So, ai2 = 
and Ai is triangularizable but not the pair (^1,^2)- 

We finally arrive to Theorem 11.111 which is a converse to Proposition 12.101 

Theorem 2.16. A sequence A = {Ai, An) £ Vn{R) is triangularizable if and only if 
each Aj is triangularizable and aj^ = /^jki = for all 1 < j,k,l < n. 

Proof. Consider A = {Ai, An) reduced with ajk = ^jki = for all I < j,k,l < n. By 
Theorem 12. 141 the conditions ajk = and Aj triangularizable mean that all subsequences 
of A of length < 2 are triangularizable. So, after a similarity that puts Ai and A2 in upper 
triangular form, we can assume (^1, ^2, A3) to be in the form 



Since A is reduced, by hypothesis 5i2 = 6162 — 6162 7^ 0. From Equation ()2.5p . we see that 
= A123 = (6162 — ei62)^C3, so C3 = which means that {Ai, A2, A3) is triangularizable. 
Repeating the argument for all triples {Aj, A^, Ai) we see that all subsequences of A of 
length < 3 are triangularizable. So A is triangularizable by Proposition 11.91 Finally, if A 
is not reduced, we consider the above argument for any maximal reduction B, triangularize 
B, and apply Corollarv 12.71 □ 




Then, conjugating hy g gives 
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2.4. Improved criterion for triangularization. We now prove an inductive property 
of reduced sequences that allow us to improve the criterion of Theorem 12.161 

Theorem 2.17. Let n> A and A = {Ai, A^-i) G Vn-i he a reduced triangularizable 
sequence. If a{Aj,An) = for some matrix An, and all j = l,...,n — 1 then (Ai, An) 
is also triangularizable. 

Proof. We can suppose that {Ai, An-i) has been conjugated so that it is already an 
upper triangular matrix sequence. So all the ajk vanish, for indices j, k between 1 and 
n — 1 (by Proposition 12. lU"]) . To reach a contradiction, assume that A' = {Ai, An) is 
not triangularizable so that 

V dn J 

is non-scalar with c„ ^ 0. Since A is reduced, none of the Aj can be scalar. We can also 
assume that An does not commute with some Aj otherwise A' would be triangularizable 
by Proposition 12. 6[ This means that 5jn ■= bjen — ejbn ^ 0, for j = 1, ...,n — 1, by Lemma 
12. 1[ Using Formula (|2.4p . the n—1 conditions ajn = 0, j = 1, ...,n — 1 can be written as 
(because c„ ^ 0) 

b'jCn + bjCjen - Cjbn = 0, for j = 1, n - 1. 

Since An is non-scalar, we are looking for a non-zero solution u = {bn, en, Cn) G to the 
matrix equation Bu = where 

/ -el ei6i bl \ 
B= ■: ■: ; . 

V -el_^ en-ibn-i / 

A simple computation shows that every 3x3 minor of B is of the form ^Sjk^ki^ij. Since 
all these minors are non-zero by hypothesis, there is no non-zero solution u G R^, and we 
have a contradiction. Hence c„ = and A' is triangularizable. □ 

From two finite matrix sequences A = {Ai, and B = {Bi, ...,i?„2) one can form 
their concatenation AuB := {Ai, An-i^, Bi, Bn^)- The following corollary may be 
called the concatenation principle for triangularizable sequences. 

Corollary 2.18. If A £ Vm and B G o,re triangularizable matrix sequences and they 
have a common reduction of length > 3, then their concatenation AU B £ Vn^+n2 ^^so 
triangularizable. 

Proof. Let C be such a common reduction, which we can assume to have length 3. C = 
(Ci,C2,C3) is obviously triangularizable and cj(Cj,Afc) = a{Cj,Bk) = for all possible 
indices, since both A and B are triangularizable. So the Proposition above applies. □ 

To prove Theorems 11.121 and 11.131 we will need the following easy fact. 

Lemma 2.19. Let n > 2. A reduced triangularizable sequence of length n as at least n—1 
diagonalizable terms. 

Proof. We can assume that A is already is upper triangular form, and let Ai and A2 be 
two non diagonalizable terms of A. Then, they are of the form 

for some oi, 02, 61, 62 G R. Thus, they commute, so A is not reduced. □ 

Proposition 2.20. Let A = (^1,^2)^3)^4) be a reduced quadruple whose terms Aj 
are all triangularizable. Then A is triangularizable if and only if ajk = for all j, k G 
{1,2,3,4}. 
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Proof. One direction is a consequence of Proposition 12.101 Conversely, let A be reduced 
with ajk = 0. By Theorem 12.141 we may assume that Ai and A2 are upper triangular, 
so that ci = C2 = 0. From Lemma 12. 19^ we can also assume that Ai is diagonalizable 
so that 61 = and ei 7^ 0. Then, reducibility implies that 62 7^ 0. In order to obtain a 
contradiction, suppose that C3C4 7^ 0. Then we have 

= (713 = ef^scs 

= (Ti4 = e\hiCi 

= 0-23 = -b2C3{b2C3 + 6263) 

= CJ24 = -04(6264 + 626264 - 6264) 
= 0-34 = -hi{clhi + 036364 - 6|c4) 

This implies 63 = 0, 64 = 0, 62C3 + 6263 = and 62C4 + 6264 = 0. The last two equations 
imply that (02, 62) is a nontrivial solution of the matrix equation 

'0 


This implies 6364 — 6364 = 0, which together with 63 = 64 = contradict the reducibility of 
A. So, C3C4 = and either A123 = or A124 = 0, by Equation (j2.5p . Assuming, without 
loss of generality, that A123 = 0, (^1,^42,^3) is triangularizable by Theorem 12.161 and 
the result follows from Proposition 12.12] □ 

Now, we are ready to prove Theorems 11.121 and 11.131 
Proof of Theorem I1.12t Suppose A is reduced of length / > 4 and assume all subsequences 
of A of length < 2 are triangularizable. Then, by Proposition 12.201 all quadruples of A are 
triangularizable, so that A is triangularizable by Proposition 11.91 □ 
Proof of Theorem \l. 13\ Let A = (Ai, A^) be a matrix sequence with maximal reduction 
A' = {Ai, ...,Ai) of length /. Then A is triangularizable if and only A' is, by Corollary 
\2.7\ showing (i) for / < 3. If Z = 4, (^1, A2, ^3, ^4) is triangularizable by Proposition 12.201 
Then, we are in the hypothesis of Theorem 12. 171 which shows that we can apply induction 
to conclude that A' is triangularizable for all / > 4. □ 

Remark 2.21. To summarize and relate our results with McCoy's Theorem, we have shown 
that over an algebraically closed field and under the simple condition that A has reduced 
length 7^ 3, ^ is triangularizable if and only if the commutators [Aj, ^jt] are nilpotent, for 
j = 1,2,3 and k > j. In particular, the monomials p{x) in Theorem 11.31 are unnecessary 
(for reduced length 7^ 3). 

3. Simultaneous Similarity 

3.1. Similarity and subtriples. Again, let G = GL2{R). We now prove Proposition 
11.101 with the help of the following easy lemma. 

Lemma 3.1. Let [Ai, A2] ^ and g £ G. If g ■ (Ai, A2) = {Ai, A2) then g is a scalar. 

Proof. Let g have columns (x, y) and {z, w) with xw — yz invertible. The conditions 
Ajg = gAj, for j = 1,2 can be written as MjU = 0, where 

/ Cj -ej \ 
Mj = { -Cj bj \ , u={z,x-w,y), 3 = 1,2. 
\ 6, -b, / 

So, the vector {z,x — w,y) E lies in the intersection of the nullspace of the Mj, each 
being generated by the non-zero vector {bj,ej,Cj), j = 1,2, (each Mj has rank exactly 2, 
as Ai and A2 are non-scalars) . So, (z, x — w,y) is zero unless (61, ei, ci) and (621 62, C2) are 
colinear. But this is not the case by Equation (j2.1|) since [^1,^2] 7^ 0. Thus, z = y = 
and X = w, so that g is scalar as wanted. □ 
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Proof of Proposition 11.101 If A and B are similar, then Aj ~ Bj for any index set 
J = Oi) ••■) Ji) with / < n and 1 < Ji < ••• < J/ < n. Conversely, let Aj ~ i3j for index 
sets J of cardinality < 3. We divide the proof into three cases, (i) Suppose A is scalar. 
Since scalar matrices are invariant under conjugation, Aj ~ Bj implies that Aj = Bj, so 
A = B. (ii) Suppose now that A is non-scalar, but commutative. Then some term of 
A, say Ai is non-scalar. Since Ai ~ Bi, Bi is also non-scalar, and after a conjugation, 
we can assume that Bi = Ai. Moreover, (jdj,^^) ~ {Bj^B^) and = implies 

that [Bj^Bk] =0. As a consequence, B is also commutative and non-scalar. Since Ai is 
a non-scalar 2x2 matrix, it is a nonderogatory matrix, so that every matrix commuting 
with Ai is a polynomial in Ai with coefficients in F, the field of fractions of R. Therefore 
A = {Ai,p2{Ai), ...,pn{Ai)) for some polynomials Pj{x), j = 2, ...,n. Since pairs are 
similar, let gj G G, j = 2, ...,n, be such that {Bi,Bj) = {Ai,Bj) = {gjAigJ^ , gjAjgJ^). 
Then Bj = gjAjgJ^ = gj pj{Ai)gj'^ = PjigjAigJ^) = Pj{Ai), for ah j = 2, ...,n. So 
B = {Bi, ...,Bn) = (711,^2(^1)5 ■■■,PniAi)) = A. (iii) Finally, let A be non-commutative. 
Then, some pair does not commute, say [^1,^2] 7^ 0. Since all pairs are similar we 
may, after a suitable conjugation, assume that (^1,^12) = (-61,-62)- Since all triples are 
similar, let gj £ G, j = 3, be such that gj • (^1,^2,^^) = (^1,^2,-6^). Then, 

since gj ■ (^1,^2) = (^1,^2) Lemma [3T] implies that gj is scalar for all j = 3,...,n, so 
Bj = gj ■ Aj = Aj and B = A. □ 

3.2. Similarity for semisimple sequences. We now work over a field F. Recall the 
following definition. 

Definition 3.2. It is called semisimple if its G-orbit (for the conjugation action (jl.ip ) is 
Zariski closed in Vn- 

This notion of semisimplicity generalizes that of a single matrix. In the context of 
geometric invariant theory, semisimplicity can be translated into more algebraic terms as 
follows. Recall that, in the general situation of a general affine (algebraic) reductive group 
K acting on an affine variety V (both over F) one defines the affine quotient variety V//K 
as the maximal spectrum of the ring of invariant functions on V, Spm.(¥[V]^) , which 
comes equipped with a projection 

q:V ^ V//K 

induced from the canonical inclusion of algebras F[y]^ C ¥\y] (see, for example, |MFK| 
or |Muj ) . The set of closed orbits is in bijective correspondence with geometric points of 
the quotient V//K. Recall also that a vector x \s said to be stable if the corresponding 
'orbit map' 

K ^V, g^ g-x 

is proper. It is easy to see that x G V is stable if and only if the X-orbit of x is closed 
and its stabilizer is finite. Another useful criterion for stability is the Hilhert-Mumford 
numerical criterion, which is stated in terms of nontrivial homomorphisms (p :¥* ^ K, 
called one parameter subgroups (IPS) of K (F* := F\ {0}). To any such (j) and to a point 
X € V one associates the composition ■= ipx ° 4' ■ ^* ^ ^ ■ 4'x can be extended to a 
morphism c/)^ : F — > we say that lim;s^^o (pxW exists and equals 0x(O). 

Theorem 3.3. (Hilbert-Mumford, see [MFKj ) A point x G V is not stable if and only if 
there is a one parameter subgroup (p of K , such that (/>x(0) exists. 

Let us return to our example of the conjugation action (jl.ip of G = GL2(^) on Vn = 
V^(F), and let S d Vn = T4(F) denote the subset of semisimple sequences. Then S/G, 
being the set of closed orbits, is in bijection with Vn//G = Spm(F[F„]'^) . As described in 
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the introduction, Drensky's theorem (Theorem I l.Sp reahzed the algebra of invariants as a 
quotient F[K]'^ = F[x]// where 

^ — (^l;'")^n)^ll)'") , 'U12 ; • • •> Un—l,n i "5123 > • •• ; Sn—2,n—l,n 

is the list of generators {N = 2n + (2) + (3) ) and / the ideal of relations. Dualizing the 
sequence 

F[x] ^F[x]// = F[K]^CF[K] 

we obtain: 

Vn ^ Vn//G = S/G C F^. 

By standard arguments, the last inclusion is precisely the map = x, that sends a 

G orbit to its values on the generators, so we conclude the following. 

Proposition 3.4. For a field of characteristic zero, the map ^ : S/G ^ ¥^ is injective. 

To obtain an analogous map for non-semisimple sequences, we first characterize these 
ones as triangularizable but not diagonalizable sequences. 

Note that, for the action of G on there are no stable points, since the scalar nonzero 
matrices stabilize any sequence A G Vn- This is not a problem, since the same orbit 
space is obtained with the conjugation action of the affine reductive group Gi = 5^2 (F) 
(determinant one matrices in GL2(F)) on Vn which has generically Z2 stabilizers: 

Vn//G = Vn//Gi. 

Note that any diagonal matrix sequence A £ D := {A G F„ : 6 = c = 0} has the subgroup 
(3.1) x-')^Gi, Xg¥* 



contained in its stabilizer. 



Proposition 3.5. ^2x2 matrix sequence is stable (for the SL2(¥) conjugation action) 
if and only if it is not triangularizable. A 2 x 2 matrix sequence is semisimple if and only 
if it is either stable or diagonalizable. 

Proof. This follows from general results (see Artin, [5]). For completeness, we include a 
proof of this particular case, using the Hilbert-Mumford criterion (Theorem 13. 3p . For the 
first statement, we can assume that A is upper triangular. Then, a simple computation 
shows that the closure of the orbit of A under the subgroup H C Gi = SL2 (F) (Equation 
13. 1|) intersects D. So, either .A is in D (and it is commutative and semisimple) and its 
stabilizer contains D, or A is not commutative, .A ^ -D so G • ^ = Gi • .A is not closed. 
In either case, A is not stable. Conversely, let A be not stable for the action of Gi. By 
elementary representation theory, any one parameter subgroup of Gi is conjugated to 

/ A" \ 



In other words, any IPS can be written as (p = g ^(pnd, for some g £ Gi and some cpn so, 
(3.2) 

Writing g ■ A as 



(3.2) lim (/>^(A) = lim^0(A) ■A = g''^ hm 0n(A) • {g ■ A). 



9- A 



aid) Ka) 

c{g) d{g) 



we obtain 

A ( \\ . ( r, . A\ — I 

c(5)A-2« d{g) 

By the Hilbert-Mumford criterion, the limit (13. 2p exists for some IPS, so we must have 
c{g) = 0, for some g € Gi. This means that g-Ais upper triangular, hence not irreducible. 
The second statement is analogous. □ 
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3.3. Similarity for non-commutative triangularizable sequences. To obtain a nu- 
merical similarity criterion for non-semisimple sequences, we are reduced, by Proposition 
13.51 to the case of triangularizable sequences. Here, we consider the non-commutative 
triangularizable case. 

Let U C Hi be the affine variety of upper triangular matrix sequences and let K C U be 
the subset of commutative sequences. For n > 2 and A £ U, let P4 denote the following 
2 X n matrix, and 5jk = 5jk{A) the corresponding 2x2 minors 

P4 = ^ ^1 ... ft")' ^jk = bjek-ejbk, j,k e {1, ...,n}. 

According to Lemma 12.11 K is characterized as consisting of sequences A such that the 
rank of P4 is < 1 (it is when all terms in A are scalars). Hence, when ^ G U \ K, P4 
has rank 2 and defines an element Trj[ of the Grassmanian G{2,n) of 2-planes in F". It is 
easy to see that the action of G = GL2{¥), restricted to U \ K, preserves the plane tt^a: 

Lemma 3.6. Let g £ G. If A and A' = g- A are both in U \ K, then e = e' and 7r_4 = vr^'. 

Proof. We can assume that A' = g ■ A for some g £ SL2{¥) and compute, for A G U \ K, 

\yw)\^d)\-y X ) \ y[ew - by) * J 

Since P4 has rank 2, ew ^ by as vectors in F". Thus, in order that 17 • ^ be again in U \ K, 
we need y = 0, which simplifies the formula above to 

/.JON A (xz\(ab\fw-z\ (a x{bx - ez) \ 

'■■^=[0 w)[o d)[o X ) = [0 d J' 

because xw = detg = 1. Then, e = e' = a — d. Moreover, the upper right entry is 
b' = x{bx — ez) = x'^b — xze, a linear combination of b and e. So tt_a = t^a'^ asserted. □ 

The correspondence A ^ tt^ is therefore well defined on the G-orbits of non-commutative 
triangularizable sequences. To make this more precise, consider the following algebraic 
subvarieties of Vn- Let U = G ■ (J he the variety of triangularizable matrix sequences 
and let /C C be the subvariety of commutative sequences. Note that U is indeed irre- 
ducible, as the image in Vn of the (irreducible) algebraic variety G x U under the morphism 
{g,A) ^ g ■ A. The irreducibility of K, was first noted in |Gej (see [Gu] for a proof). 

By the previous lemma, given a sequence A £l{\JC, we can define '■= t^b using any 
matrix sequence B G G- Ar\\}, that is, B is upper triangular and similar to A (even though 
the matrix P4 is not defined). So, we can define a map : U\K: P^-\ where P^"^ 
denotes the projective space over F of dimension — 1 = (2) — 1, as the composition 

U\1C 

TT i \ V 

G(2,n) ^ P^-^ 

where the bottom inclusion is the Pliicker embedding of the Grassmanian G(2, n). In more 
concrete terms we have. 

Lemma 3.7. Let A G U\1C. Then V'(^) = [^12 : ^13 : • • • : 5n~i,n], where 6jk = 6jk{B), for 
any B e G ■ An\J. 

Proof. The formula follows from the definition of the Pliicker embedding. We just need to 
show that the map is well defined. The point [5i2 : S13 : ■ ■ ■ : 5n-i,n] is in projective space 
since at least a pair of terms, say Aj and Ak do not commute, so that 5jk{B) = bjSk — ejbk 
is nonzero, by Lemma l2.ll On the other hand for a different sequence B' £ G ■ ACiO, 
Equation (jS.Sp and e = e' (Lemma 13.60 imply 

5jk{B') = b'je'f, - e'jb'f. = x{bjX - ejz)ek - ejx{bkX - ekz) = x'^5jk{B), 
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so the point il^{A) G ^ is indeed independent of the choice of B G ■ ACiU. □ 

By Lemma [3. 71 the quotients 5jk/Sim (for j ^ k and / ^ m) are well defined rational G- 
invariant functions oiiU\IC (defined on the open dense complement of (5j^(0) CU\IC), so 
they descend to the quotient space {U\IC)/G. Note, however, that these are not quotients 
of regular (polynomial) invariants. 

Define now the map ^' : (W \ /C)/G ^ F'^ x F^+" x F^~\ N = (^^) , by 

*([^]) = {{tj}j=l,...,n, {tjk}j,k=l,...,n, ^) , 

where [A\ denotes the conjugacy class of tj = tr{Aj), tj^ = tr{AjAk)^ and was given 
by Lemma [3.71 The main result of this subsection is the following. For the proof, we use a 
standard consequence of the Noether-Deuring theorem; namely, if A and A' are elements 
in Vm,n(^) and g G GLm(F) verifies g- A = A' (i.e, similarity over F) then they are similar 
over F (see eg. [0^ p. 200). 

Theorem 3.8. Let n>2. The map ^ is two-to-one. More precisely, the rational invari- 
ants 6jk/Sim, together with the regular invariants tj and tjk, distinguish G-orbits inU\IC, 
except for the identification e ^ —e, when written in triangular form. 

Proof. Let both sequences A and A' be triangularizable and non-commutative, so there 
exist i3 e G • ^ n (U \ K) and G G • ^' n (U \ K). Let us write tj = tr{Bj) = tr{Aj) 
etc, as before, and use primed letters to denote corresponding quantities for A' or B'. To 
prove injectivity, suppose ^'([.4]) = ^'([.4']) so that tj = t'j and tj^ = t'-^ for all appropriate 
indices. Then, 

EjCfc = 2tjk — tjt}^ = 2t'ji^ — t'jt'f. = e'jc'^. 

Since any triple {x^ ,xy,y'^) G F^ determines (x,y) up to sign, the equation above implies 
that e' = ibe, as vectors in F". Suppose e' = e. The hypothesis ^'([^]) = ^'([.4']) means 
also that '4){B) = il){B'). Thus vrg = vrg' as planes in G(2,re) due to the injectivity of the 
Pliicker map. Therefore b' = ab + [ie for some complex numbers a, /3, a ^ 0. So, using 
the invertible 

where y/a is any square root of a in F, it is easy to see that g ■ B = B', so that A and A' 
are similar over F. So, A and A' are also similar over F. Finally, with e' = — e (note e 7^ 
by hypothesis), one can see that B' is not similar to B, hence the result. □ 

In view of Proposition \'d.5\ Theorem 13.81 and Proposition 13.41 give a solution to the 
invariants problem (ii) for non-commutative sequences. Theorem 11.141 provides a more 
efficient solution (and works for any characteristic 7^ 2), and its proof will be given after 
exploring the canonical forms described in the next section. 

Remark 3.9. Finding an analogous map in the commutative case is more involved! See 
Friedland, [E] for a discussion of the case of a commuting pair of 2 x 2 matrices. However, 
testing similarity of commutative 2x2 matrix sequences is a trivial task after reduction 
to triangular form, as recalled in Appendix B. 

We end this section with the following conjecture on the generalization of Proposition 
11.101 to m X m matrices. Since an irreducible sequence is a generic sequence and the 
conjugacy classes of triangularizable or block-triangularizable sequences seem to depend 
on less data (i.e, less regular or rational invariants) than the irreducible case, we propose 
the following problem. Define the "semisimple similarity number" S{m) as 

^(m) = min{A; G N : Vn G N, V^, ^ G A B ^ Aj Bj W J with \J\ < k}, 

where Sn m C Vn m the subset of semisimple sequences. 
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Conjecture. For arbitrary m x m sequences (not necessarily semisimple) A ^ B if and 
only if A J ~ Bj for all index vectors J with length < S{m). 

Note that, by Procesi's Theorem, S{m) < N{m). This conjecture is true for m = 2, by 
Proposition II.IU] since Proposition 12.121 gives S{2) = 3. 

4. Canonical Forms and Reconstruction of Sequences 

4.1. Canonical forms. We now describe canonical forms for 2x2 matrix sequences over 
an algebraically closed field F. Informally, this means the indication, for each given matrix 
sequence A G Vn, oi an element in its conjugacy class which has a simple form, and this 
same simple form should be used for the biggest possible set of sequences, hence the term 
'canonical'. We will assume n > 2 since such canonical forms for n = 1 are provided by 
the well known Jordan decomposition. Also, for commutative sequences, the canonical 
forms are the same as for n = 1. This is recalled in Appendix B. For non-commutative 
sequences, it turns out that canonical forms can be divided into 5 cases. 

By the preceding results, it is no surprise that we need to consider distinct canonical 
forms for the stable and for the reducible cases. We make the following choices. 

Definition 4.1. We say that a stable (i.e, irreducible) matrix sequence A = {Ai,A2, ■■■) 
is in (stable) canonical form if Ai is in Jordan canonical form and 62 = 1- We say that a 
triangularizable sequence A is in (triangular) canonical form if A is upper triangular, Ai 
is diagonal and 62 = 1- 

Recall that, by Proposition 12. 191 a triangularizable sequence of length n >2 has at least 
n — 1 semisimple (or diagonalizable) terms. By contrast, a stable sequence can have all 
terms which are non-diagonalizable. To see this, just consider the family of matrices of 
the form 

/ —a/3 \ 

for some parameters a and (3 not both zero. Then, all these matrices are similar to the 
2x2 Jordan block with zero diagonal (i.e, a = 1, /3 = 0), but the a of two of these is zero 
only when the vectors (a, (3) are colinear. We have, however, the following fact. 

Proposition 4.2. Let A have reduced length n = 2 or n > 4. Then A is stable if and 
only if some ajk / 0. 

Proof. If A is not stable, then it is triangularizable, by Proposition 13.51 so that all cr = 0, 
by Proposition I2.10[ Conversely, suppose A is stable with reduced length n = 2. Then 
ai2 7^ because of Friedland's result (or Theorem II. lip . Finally, if n > 4, Proposition 
12.201 implies that at least one ajk 7^ 0. On the other hand. Example 12.111 shows that the 
statement is not true for n = 3. □ 



Theorem 4.3. All non- commutative matrix sequences can be put in canonical form. More 
precisely, after rearranging terms, any sequence is similar to a sequence with A\,A2 and 
A3 as described by the following table. 



stable case 


l.a 


(J12 / 0, 5i / 


Ai diagonal, 62 = 1 


l.b 


ai2 / 0, 5i = ^2 = 


Ai Jordan block, 62 = 1 


l.c 


0'12 = 0-13 = (^23 = 0, A123 / 


Ai diagonal, 62 = 1 


triangularizable case 


2. a 


all diagonalizable 


Ai diagonal, 62 = 1 


2.b 


1 non-diagonalizable {A2) 


Ai diag., A2 Jordan bl. 



Proof. Let us show that only the five possibilities above occur, and may be put in the 
given forms, starting when A is stable. If re > 4 or n = 2, Proposition 14.21 implies that 
there is some o" 7^ 0, so we can rearrange the terms of A so that we have CJ12 7^ (in 
particular, [^1,^2] 7^ 0). If Ai or A2 is diagonalizable, we have possibility l.a. Assuming 
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Ai diagonalizable, we may suppose that Ai is already in diagonal form. Since Ai is 
nonscalar 5i ^ 0, and the stabilizer of Ai are the diagonal invertible matrices H G G. 
Then (T12 = 6^6202 7^ implies that both 62 and C2 are nonzero. Let g = diag(x,a;~^) for 
some 2; G F*. A simple computation shows that g^^Aig is diagonal and A2 = g^^A2g has 
62 = x~'^b2- So, we can solve for x in order to have 62 = 1; claimed. If Ai and A2 are 
both not diagonalizable, we have case l.b. In this case, we may suppose Ai is already in 
Jordan form. Conjugating A2 with a matrix of the form 

-('^ 

a simple computation shows that we can solve for z in order to obtain 62 = 1 . If n = 3 and 
some (T12 / 0, we return to cases l.a or l.b. So it remains the case n = 3 and all cr = 0, 
which is I.e. From Example 12.111 we see that necessarily A123 7^ (after an eventual 
rearrangement of terms), and a diagonal conjugation will achieve 62 = 1- Finally, when 
A is triangularizable, so by Proposition 12.191 either none or one of the terms of A are 
non-semisimple, cases 2. a and 2.b respectively. The forms mentioned will be obtained by 
conjugation with a diagonal invertible matrix. □ 

In the above table we have [^1,^12] 7^ in all cases, so if g stabilizes {Ai, A2), then g is 
scalar, by Lemma |3.1[ Therefore, we have a uniqueness statement. 

Proposition 4.4. LetA,B be two sequences in canonical form with [yli, A2] 7^ 0. Assume 
that {Bi,B2) = {Ai,A2). Then, A ^ B if and only if A = B. 

4.2. Reconstruction of sequences from invariants. We continue to work over an 
algebraically closed field F, and here we restrict to characteristic 7^ 2. Let 5'(F) denote 
the subset of Vni^) of semisimple sequences such that Ai is diagonalizable and [Ai, A2] 7^ 0. 
Then 5'(F) consists of irreducible (stable) sequences that can be put in the form l.a. Let 
!> : 5'(F)/G(F) ^ F^"-3 denote the map 

(4.1) *^([-4]) '■= {tl,hl,t2, t22,tl2, ■■■,tk,tlk,t2k, Sl2k---, ^n, *ln, t2n, •Sl2n) • 

We now describe a process of (re)constructing a sequence from its values under the map 
^. Let V E F'^''-^ be given. We want to find A £ S'{¥) such that ^{[A]) = v. 

Let ai and di (in F) be the roots of the polynomial A^— tiA+ *^ 2*" = (^^^ characteristic 
polynomial of a diagonal matrix whose trace is ti and the trace of its square is tu). If they 
are equal, there is no solution to our problem simply because an A satisfying $([^]) = v 
will have either Ai non-diagonalizable or [^1,^2] = 0. So, with ei = ai — 61 7^ 0, put 
61 = ci = 0, 62 = 1 and 



C2 



^22 ~ 0-2 ~ ^2 _ 1 

2 " 44 



Tu Ti2 
T2I T22 



(4.2) 



a2 A ^ / 1 1 \ W t2 
d2 I \ ai di ) \ ti2 



Then, one easily checks that the pair (^1, ^2) £ ^(1?') whose entries are a, h,c,d £ F^ is in 
canonical form l.a and satisfies ^([.A]) = (ti, in, i2) ^22; ^12)- Moreover, this pair is unique 
except for the choice of assigning to ai or di one or the other root of the characteristic 
polynomial. The two possible choices are: 

ai W 02 1 \\ ffdi \ f d2 I 
di ) \ C2 d2 ) ) ' \\ ai ) \ C2 02 

These pairs are similar (for ei and C2 both non-zero), with similarity matrix 

l/x 



(4.3) g={ ^ ' , x = V^2. 
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Moreover, up to a non-zero scalar multiple, this is the unique matrix sending one pair to 
the other. Let A"^ := g ■ A denote the sequence obtained by acting with this matrix. Note 
that [A] = [A^] and if A is in canonical form, then so is A"^. Now, let 



(4.4) 





/ 


1 








1 


\ 




{ 


tk 




bk 




ai 








di 








tik 




Ck 




0.2 




1 


d2 








hk 




\dk 1 


V 





-C2ei 


ei 









\ 


Sl2k 


) 



k = 3, n. 



The determinant of this matrix is — 2e^C2, so our hypothesis imply that it is indeed invert- 
ible. Hence the transformation above provides an isomorphism of vector spaces, from the 
variables {ak,bk,Ck,dk) to the variables (tk,tik,t2k, Si2fc), k = 3,...,n. 

Now, consider the reconstruction of triangularizable sequences. Let U'{¥) denote the 
subset of Vn{^) of triangularizable sequences such that Ai is diagonalizable and [Ai, A2] 7^ 
(so that U'{W) CU\)C)._ Define the map ^ : U'{W)/G x P"-^ where P'^' denotes 

the projective space over F of dimension k, by the formula 

^{[■^]) = {tl,tll, ■■■,tk,tlk, ■■■,tn,tln; , 

where ip'{[A]) = [1 : ^13 : • • 
triangular form. Let w £ F^" 



5in] G P" , 6jk := bjCk — ejbk assuming A is in upper 
"'^^ be given. Again, let oi and di be the (distinct) roots 



of the characteristic polynomial — tiX + 



■ill 



0. Then, with ei = ai — di / 0, put 



0, 62 



1, Cj 



0, j = 1, n, and 



dk 



bk 



1 

ai 

^ik 
ei ' 



1 

di 



k = 3,. 



hk 

, n. 



Then the matrix sequence A with the entries a, 6, c, d G F is in triangular canonical form 
(2.a or 2.b) and satisfies ^{[A]) = w, with ip'{A) = [I : 813 : ■ ■ ■ : 5in] £ P"'^ The other 
solution A' is obtained by changing e to — e (that is, exchanging a with d). We have. 

Proposition 4.5. The map $ : 5'(F)/G(F) F''"-^ is injective, and ^ : U'{f)/G{f) 



F 



2n 



is two-to-one. 



Proof. Let Ac = {Ai, A2, ...) and Be = {Bi,B2, ...) be canonical forms associated to A,B E 
S'{¥) respectively. If l'([A]) = ^([-SJ) then either (^1,^2) = (-81,^2) or (^1,^2) = 
{Bi,B2)^ . If the first situation holds, then Ac = Be by the isomorphism given by Equation 
(|4.4p . Hence, Proposition 14.41 shows that A ^ B. In the second alternative, just replace 
Be with B^ (using g in Equation (|4.3|) ) and the conclusion is the same, showing that ^ is 
injective. The case of ^ is analogous, although in this case the two possibilities for a pair 
in triangular canonical form with given values of (ti, tii, t2; ^22) are: 



ai 



d^ 



02 1 
d2 



d. 



ai 



d2 1 

0-2 



These pairs are not similar (note ei 7^ 0), by Lemma 13.61 resulting in ^ being 2-1. □ 



Now, the proof of Theorem 11.141 is easy. 
Proof of Theorem \l.\A\ Let F be a field of characteristic 7^ 2 and F its algebraic closure. We 
have shown that $ : 5'(F)/G(F) P"-^ is injective. Then, the map : 5'(F)/G(F) 
]p4n-3 jgfined in Equation (|1.6p . having the same form of is just its restriction to 
5'(F)/G(F), being therefore injective as well. The inclusion 5'(F)/G(F) C 5'(F)/G(F) 
reflects the fact that two sequences in 5'(F), similar over G(F), are also similar over G(F), 
by the Noether-Deuring theorem (see eg. |CR] p. 200). The case of ^ is analogous. □ 
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Remark 4.6. Finally, we argue that the other cases of semisimple non-commutative se- 
quences in the table of Theorem 14.31 can also be reconstructed by a simple modification 
of the above procedure. In the case l.b, we substitute the pair (^1,^2) by the pair 
(^1 — A2,Ai + A2), and in the case l.c (assuming, without loss of generality, that Ai is 
diagonal non-scalar), we perform the substitution {Ai, A2, A3) 1— > (Ai, A2 + A3, A2 — A3). 
In both cases we end up with a matrix in S', that is, the first matrix is diagonalizable 
non-scalar and the first pair does not commute. 

In conclusion, the only conjugacy classes that the maps ^' and ^' fail to distinguish 
(not counting the involution e <-> — e in the ^' case, and after the reduction to S' described 
above) are the following types: 

which have the same value under <!>' (note that these are not in the domain of ^' , as they 
are commutative), regardless of 6 G F". 

Appendix A. Triangularization of singlets over R 

In this Appendix, we treat the triangularization problem for a single 2x2 matrix over 
an integral domain R. We do not claim originality of Proposition lA.ll and Lemma IA.2I 
below, although the author was unable to find a suitable reference. Their exposition is 
mainly intended to provide self-contained proofs of Theorems 11.111 and Theorem 12.141 

When is a single matrix with entries in R triangularizable? Let us write 

(A.l) A={^1 S)^^i(^)' ^={1 :,)^GL2iR). 

Since g is invertible, so is detg = xw — yz, and conjugation of A by g~^ gives: 
(A.2) g-'-A = g-'Ag 



-1 A _ -1 A 1 f * bw"^ + ezw — cz^ 



xw 



yz \ cx^ — exy — by^ * 



So, triangularizing A amounts to finding a solution (x, y, z, w) G R to one of the equations 



ex" — exy — by"^ = or bw"^ + ezw — cz^ = 0, such that xw — yz is invertible in R. Note 
that both equations are given by the same quadratic form 

Q{x, y) := cx^ - exy - by'^ = ^(x, y)QA{x, y)'^ 

associated to matrix 

2c -e 



Qa 



-e -2b 



The discriminant of Q is 6q := —detQA = e^-|-46c = tr^^ — 4detA, and coincides precisely 
with the discriminant 5a of the characteristic polynomial of A. 

Equation (|A.2p provides a necessary condition for triangularization: If A is triangulariz- 
able over R, then its eigenvalues lie in R. Indeed, if g^^Ag = T for some g G GL2{R) and 
some upper triangular matrix T, 6a = St = (Ai — A2)^ is a square in R, where Ai, A2 G -R 
are the diagonal elements of T, which are also the eigenvalues of A. 

A necessary and sufficient condition is the following. 

Proposition A.l. Let A G Vi{R) be a 2 x 2 matrix over an integral domain R. A is 
triangularizable if and only if it has an eigenvector of the form {x,y) G R? , such that 
xR + yR = R (in particular, the ideal (x, y) is principal). 

Proof. The equation Ag = gT, for some invertible g G G{R) and upper triangular T, 
means that the first column of g is an eigenvector (x, y) of A, so that x,y £ R and there 
exist w,z £ R (forming the second column of g) so that xw — yz is a unit. In particular, 
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xR + yR = R. Conversely, let A be as in (lAJ]) . with an eigenvector {x,y) G R? verifying 
xR + yR = R. Prom simple computations the eigenvalues of A (both in R) are Ai = ""'"^"'"^ 
and A2 = "^2~^ ' where r is a square root of 5a = — 'ibc, and the respective eigenvectors 
are vi = (Ai — d,c) and V2 = (A2 — d,c) (both in R'^). So, without loss of generality, let 
(x,y) be in the eigenspace of vi: {x,y) = a(Ai — d,c) = a(^^,c), for some nonzero a 
in the field of fractions of R. By hypothesis, there exist z,w £ R verifying xw — yz = 1. 
Thus, using the invertible matrix g with columns {x,y) and {z,w), we have that g~^Ag 
is upper triangular (with Ai and A2 in the main diagonal). Indeed, by Equation (|A.2p its 
lower left entry is cx'^ — exy H — by'^ = ^c{r'^ — — 46c) = 0. □ 

The following fact is used in the proof of Theorem 12.141 

Lemma A. 2. Fix nonzero elements x,y in a field ¥ . If A G Vi(F) is nondegenerate (5a 7^ 
and verifies cx^ — exy — by'^ = 0, then A is diagonalizable and one of its eigenvectors 
is (x, y). In particular, there is another eigenvector {z, w) G with wx — yz ^ 0. 

Proof. To satisfy cx^ — exy — by^ = 0, the triple (6, e, c) must be a linear combination of 
the vectors (— x,y,0) and (0, x,y). So, the matrix A is given (uniquely up to the addition 
of a scalar) by the triple (6, e, c) = {—zx, zy + wx, wy), for some z,w £¥. Then, a simple 
computation shows that the discriminant of A is a square in F: 6a = (wx — zy)'^. So A 
is triangularizable over F. Moreover, its eigenvalues are easily checked to be Ai = wx and 
A2 = yz. Since, by hypothesis 6a 7^ 0, the eigenvalues are distinct, so A is diagonalizable 
over F. Also, the eigenvectors are multiples of (x,y) and {z,w), respectively. □ 

Appendix B. Canonical Forms and Similarity of Commuting Sequences 

For completeness, we include the following well known description of all canonical forms 
of commuting sequences over an algebraically closed field F. 

Theorem B.l. A matrix sequence of length n with coefficients in F is commutative if and 
only if it is similar to a matrix sequence in one of the forms ( called diagonal or triangular 
forms, respectively (both forms include the scalar sequences)): 

A^^" S=(<' I), «,d.66F" 

Proof. The sufficiency is clear by Lemma 12.11 (condition (j2.2|) is satisfied because 6 = 
(resp. e = 0) in the first (resp. second) case). Conversely, since A is commutative it 
is triangularizable, by Corollary 12.21 Let > 1 be the smallest integer such that A^ is 
non-scalar. By an appropriate conjugation, one can assume that either A^ is diagonal or it 
is written as a single Jordan block (here we are using the assumption on F). Then lemma 
12.11 implies that A is either in diagonal or in triangular form, as wanted. □ 

The following consequence is clear. 

Corollary B.2. Let A and A! be commutative, both of the same form as described in the 
Theorem above. If A is in diagonal form, then A ^ A' if and only if a = a' and d = d' or 
a = d' and d = a' . If B is in triangular form then B ^ B' if and only if a = a' and b = Xb' 
for some A G F \ {0}. 
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